Team Optimal Decentralized Control of System with Partially Exchangeable Agents-Part 1: Linear Quadratic Mean-Field Teams
نویسندگان
چکیده
We consider team optimal control of decentralized systems with linear dynamics and quadratic costs that consist of multiple sub-populations with exchangeable agents (i.e., exchanging two agents within the same sub-population does not affect the dynamics or the cost). Such a system is equivalent to one where the dynamics and costs are coupled across agents through the mean-field (or empirical mean) of the states and actions. Two information structures are investigated. In the first, all agents observe their local state and the mean-field of all sub-populations; in the second, all agents observe their local state but the meanfield of only a subset of the sub-populations. Both information structures are non-classical and not partially nested. Nonetheless, it is shown that linear control strategies are optimal for the first and approximately optimal for the second; the approximation error is inversely proportional to the size of the sub-populations whose mean-fields are not observed. The corresponding gains are determined by the solution of K+1 Riccati equations, where K is the number of sub-populations. The dimensions of the Riccati equations do not depend on the size of the sub-populations; thus the solution complexity is independent of the number of agents. Generalizations to major-minor agents, tracking cost, weighted mean-field, and infinite horizon are provided. The results are illustrated using an example of demand response in smart grids.
منابع مشابه
Linear Quadratic Mean Field Teams: Optimal and Approximately Optimal Decentralized Solutions
We consider team optimal control of decentralized systems with linear dynamics, quadratic costs, and arbitrary disturbance that consist of multiple sub-populations with exchangeable agents (i.e., exchanging two agents within the same sub-population does not affect the dynamics or the cost). Such a system is equivalent to one where the dynamics and costs are coupled across agents through the mea...
متن کاملMutually Quadratically Invariant Information Structures in Two-Team Stochastic Dynamic Games
Abstract—We formulate a two-team linear quadratic stochastic dynamic game featuring two opposing teams each with decentralized information structures. We introduce the concept of mutual quadratic invariance (MQI), which, analogously to quadratic invariance in (single team) decentralized control, defines a class of interacting information structures for the two teams under which optimal linear f...
متن کاملRobust Mean Field Linear-Quadratic-Gaussian Games with Unknown L2-Disturbance
This paper considers a class of mean field linear-quadratic-Gaussian (LQG) games with model uncertainty. The drift term in the dynamics of the agents contains a common unknown function. We take a robust optimization approach where a representative agent in the limiting model views the drift uncertainty as an adversarial player. By including the mean field dynamics in an augmented state space, w...
متن کاملCentralized Versus Decentralized Team Games of Distributed Stochastic Differential Decision Systems with Noiseless Information Structures-Part II: Applications
In this second part of our two-part paper, we invoke the stochastic maximum principle, conditional Hamiltonian and the coupled backward-forward stochastic differential equations of the first part [1] to derive team optimal decentralized strategies for distributed stochastic differential systems with noiseless information structures. We present examples of such team games of nonlinear as well as...
متن کاملDecentralized Optimal Control for the Mean Field Lqg Problem of Multi-agent Systems
This paper investigates the decentralized optimal control of the linear quadratic Gaussian (LQG) problem in discrete-time stochastic multi-agent systems. The state equations of the subsystems are uncoupled and the individual cost function is coupled with the states of other agents. With the help of state aggregation technique and the mean field structure, we get the decentralized optimal contro...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1609.00056 شماره
صفحات -
تاریخ انتشار 2016